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Who am I? 


eVK Cloud Storage Architect 
*Tarantool Architect 
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Who am I? 


*VK Cloud Storage Architect 

¢Tarantool Architect & PM & Evangelist 

e| use queues since 2008 

“| like to create queues (esp. with Tarantool) 
“| like to talk about queues 


What are queues for? 


e Task distribution 


Task distribution 


Task distribution 
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Task distribution 


What are queues used for? 


e Execution scheduling 


Execution scheduling 


Execution scheduling 


What are queues used for? 


Fair resource allocation 


What are queues used for? 


14:40—15:30, Today, 03 Hall 


Fair threaded task scheduler 


e Fair resource allocation AEE TLA 


Vladislav Shpilevoy 
(Ubisoft) 


What are queues used for? 


Messages replication 


What are queues used for? 


*Fault tolerance, durability, delivery guarantee 


What are queues used for? 


eMicroservices communication 


What are queues used for? 


*Event-driven architecture (Event Sourcing) 
“Streaming architecture 


Where are queues used? 


Hardware IRQ 
NCQ 
Hardware Buffers 


Where queues are used? 


€0S Kernel epoll / kqueue 
networking 
signal handling 


Where queues are used? 


“Applications — thread 


Where queues are used? 


e Network interactions 


Where queues are used? 


e Distributed systems 


Where queues are used? 


*eConnection of businesses 


Where queues are used? 


“In fact — everywhere 


Where queues are used? 
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“In fact — everywhere Queues are the glue@ 


What is a queue? 


“A media for communication using messages 


What is a queue? 
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What is a queue? 


*One-to-one approach (Put/Take): 1 > 1 


Put/Take 


What is a queue? 


One-to-many approach (Publish/Subscribe): 1 > * 


What is a queue? 


eSynchronous flow (Request/Response): 1S 1 


What is a queue? 


e Protocols: AMOP, MOTT, STOMP, NATS, ZeroMQ, ... 


What are the options? 


e Cloud solutions 
e Amazon SQS, EventBridge 
e Google Cloud Tasks 
e CloudAMQP 


What are the options? 


e Message brokers 
e RabbitMQ 
e Apache Kafka 
e ActiveMQ 
e Tarantool Queue 
e NATS 
e NSQ 


What are the options? 


*Database-backed queues 
*PgQueue 
e Tarantool 
e Redis 


What are the options? 


14:40—15:30, Tomorrow, 01 
Hall 

Transactional queues 

in PostgreSQL 


Igor Loban 

(Toloka.ai) 

*Database-backed queues IO 
*PgQueue 
*Tarantool 


e Redis 


What are the options? 


e“Sockets on steroids” 
e NATS, ZeroMQ 


Who are the players? 


“Apache Kafka 
* Distributed message log for streaming 
*RabbitMQ 
e Traditional broker with AMQP protocol 
Managed Cloud Queue (SQS/EventBridge/MQ/...) 
e The solution to be used in clouds 
“NATS 
e Microservices communication bridge 
*Tarantool 
*The platform for custom built solutions 


Apache Kafka ei 


*Replicated & sharded message log 
Strict delivery order (FIFO) 
SA limited number of consumers 
eStream replay 
e Apache ecosystem integration 
e Mainly used for 

*Data analysis. Logs, metrics, audit 


e High performance stream data processing 
*Consistent data replication 


Apache Kafka 


11:10—12:00, Tomorrow, 01 
Hall 
Kafka architecture: perormance 


Alexander Sibiryakov 
(Zyte) 


15:50—16:40, Tomorrow, 02 
Hall 

Kafka for Golang delevelopers: 
tips and tricks 


Denis Filippov 
(Coins.ph) 


RabbitMQ UE 


e Protocols: AMQP, MQTT, STOMP 

*Prioritized, delayed and background tasks 
Unlimited number of consumers 

Flexible storage: memory, disk, replication, quorum 
e Easy to learn. Hard to master 


e Mainly used for 
e Microservices communication 
e As a message bus 
e As a traditional pub/sub broker 


Managed Cloud Queue Rey 


e Reliable and scalable queue 

*Stateless protocol, simple interaction 

“Standard API 

*Pay-as-you-go: low consumption — low expenses 


e Mainly used for 
Communication between cloud components 
*Serverless architecture 


NATS Messaging 


*Fast non-persistent message communication 
“High performance and scalability 
*Multi-paradigm: pub/sub, put/take, req/res 
*Expandable with JetStream 

e Streaming support 

e Durable storage, RAFT cluster 
e Mainly used as 

e Messaging media for distributed systems 


M 


Tarantool 


e High-performance broker (Tarantool Queue) 
“Integration with streaming systems 

“The platform for building custom queues 

e Arbitrary logic and event processing 

e Transaction support within exchange 

e Mainly used for 


e High performance and high throughput message delivery 
* Building complex event processing with custom logic 


Possible problems 


Or what IS worth to know? 


e Algorithms of queues 

Network problems 

*e“Exactly once” 

e Network (again) and disk problems 
*Outages & failures 


Algorithms 


*FIFO, 
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Algorithms 
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Algorithms 


e Message prioritization 
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Algorithms 


e Nested queues, hierarchy 


Algorithms 


e Retry, delay, retry with delay 


Algorithms 


*Dead letter queue (and reordering) 


Algorithms 


e Task dependency 


Algorithms 


TIL, TTR, Putback 


Would you like yet more? 


*Prioritization and Starvation 


Would you like yet more? 


e Throughput 
Performance 


Would you like yet more? 


e Throughput 
e Performance 
e Scalability 


Would you like yet more? 


Limits and Capacity 


Would you like yet more? 


*Reliability 


Would you like yet more? 


e Reliability + Availability + Durability 


What's next? Network 


Undefined behavior 


Two Generals’ Problem 
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Two Generals’ Problem 


There are two complex 
problems in distributed 
systems: 

2. Exactly once delivery 
1. Strict message order 
2. Exactly once delivery 


Solution for “exactly 
once” 
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Solution for “exactly 
once” 


Solution for “exactly 
once” 
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Problems: network & disk 


e Throughput 
e Latency 


Outages & failures 


Hardware failures 
e Disk 
e Host 
e Datacenter 


Outages & failures 


“Temporary failures 
e Power 
e Network 
e Split brain 


Outages & failures 


*Permanent failure 
e Physical destruction 


Outages & failures 


*Permanent failure 
e Physical destruction 


Reliability explained 


e Availability 


e Durability 
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e Availability 
* Ability to receive messages 
* Ability to store messages 


e Durability 


Reliability explained 


e Availability 
* Ability to receive messages 
* Ability to store messages 
e Durability 
* Ability to not lose received messages 
e Delivery guarantee 


Main queue properties 


e Availability 
e Ability to receive & store messages 
e Durability 
* Ability to not lose received messages 
e Delivery guarantee 
e A promise to deliver 
““At least once” (X = 1) 
“At most once” (X = 1) 
e “Exactly once” is impossible, while stated by many” 


Main queue properties 


e Availability 
e Ability to receive & store messages 
e Durability 
e Ability to not lose received messages 
“Delivery guarantee 
SA promise to deliver “At least once” or “At most once’ 
e Scalability 
* Ability to increase throughput by adding nodes 
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Topology: single instance 


dem -0 


Topology: single instance 


Sake 


RabbitMQ 
Redis 
ActiveMQ 
Beanstakld 


Topology: single instance 


Eka 


Availability: low 
Durability: low 
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Topology: single instance 
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Scalability: no Availability: low 
Guarantee: X < 1, X => 1 Durability: low 


Topology: single instance 
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Scalability: no Availability: low 
Guarantee: X < 1, X => 1 Durability: low 


Topology: multi-instance 


Scalability: yes 


Multiple queues, put to 1 


Scalability: yes Availability: high 


Multiple queues, put to 1 


Scalability: yes Availability: high 
Guarantee: X < 1, X => 1 Durability: medium vê 
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Multiple queues, put to 
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Availability: high 
Durability: high 
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Multiple queues, put to 
K/N 


“Multiplication” 
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Guarantee: X = K Durability: high 


Replication 


Interaction Replicas 
with a leader are on standby 


Replicated queues, 1/N 


“like a database” 


see Ste, 


Availability: high 
Durability: high 


Replicated queues, 1/N 


“like a database” 


Scalability: yes Availability: high 
Guarantee: X = 1 (X = 1) Durability: high 


“Like a database”: quorum 


Quorum writes give protection from data loss 
Quorum guarantees consistency, X > 1 (X = 1) 
Most reliable. But slow. 


Quorum queue cluster 


Guarantee: X = 1 (X = 1) Durability: high 


Quorum queue cluster 


Availability: limited 
Guarantee: X = 1 (X = 1) Durability: high 


Quorum queue cluster 


Availability: limited 
Guarantee: X = 1 (X = 1) Durability: high 


Quorum queue cluster 


Scalability: no Availability: limited 
Guarantee: X = 1 (X = 1) Durability: high 


Replicated queue, 1/N 
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Quorum queues, 1/N 


Scalability: yes Availability: high 
Guarantee: X = 1 (X = 1) Durability: high 


Quorum queues, 1/N 


Scalability: yes Availability: high 
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Quorum queues, 1/N 


Scalability: yes Availability: high 
Guarantee: X = 1 (X = 1) Durability: high 


What else to know? 


e Protocols & limitations 
e Monitoring & maintenance 
e Best choices 


Protocols & limitations 


Message state 
is bound to connection 


“Low latency 
*Immediate requeue 


Hard to scale 
* Task lifecycle 


Stateless 
(HTTP/REST/SQS) 


*Scaling 
*HTTP-balancing 


e Autorelease IS a Must 


Monitoring & maintenance 


*Queue capacity and consumption 
“There is always a limit. 


Monitoring & maintenance 


*Queue capacity and consumption 


eTiming 
e Full event cycle (QoS) 
e Consumer execution time 


Monitoring & maintenance 


*Queue capacity and consumption 


e Timing 


e Quantity of retries, losses and failures 


Monitoring & maintenance 


*Queue capacity and consumption 


e Timing 


e Quantity of retries, losses and failures 
e Message flow 


Monitoring & maintenance 


«Duplicate your messages to logs! 


Maintenance: plan a 
fallure 


“Set up failure policies 


Maintenance: plan a 
fallure 


“Set up failure policies 
e Stop accepting new messages 
*Discard/drop old messages 
e Save the survivors 


Maintenance: plan a 
failure | 
Our greatest glory is 


“Set up failure policies Me her oi 
*Stop accepting new messages But in rising 
*Discard/drop old messages every time we fall. 
e Save the survivors 


Maintenance: plan a 


“Set up failure policies mel ana Du 
"Stop accepting new messages But in rising . 
*Discard/drop old messages every time we fall. 


eSave the survivors 


“Have a plan for a fall down 
eTo use It to get up 


What If & how to? 
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What If & how to? 


“Event loss tolerance 
“Simple message delivery 
“High throughput, scalability 


What if & how to? N 


“Event loss tolerance 
“Simple message delivery 
“High throughput, scalability 


e NATS 
e NSQ 
e ZeroMQ 


What If & how to? 


“To give it a try 


What If & how to? 


“To give it a try 


“SOS (Simple Queue Service) 
Amazon, GCP, ... 
e CloudAMQP 


eSimple brokers: RabbitMQ, NATS 
(watch out for durability and availability) 


What If & how to? 


“For cloud applications 
*For microservices 
For serverless architecture 
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What if & how to? Rey 


“For cloud applications 
*For microservices 
For serverless architecture 


“SOS (Simple Queue Service) 
Amazon, GCP, ... 


Amazon EventBridge 


What If & how to? 


“Set up a streaming architecture 
“High requirements for durability 
e Strict FIFO 


What if & how to? ei 


“Set up a streaming architecture 
“High requirements for durability 
e Strict FIFO 


“Apache Kafka 
e NATS JetStream 
¢Tarantool 


What If & how to? 


Complex processing scenarios and chains 
*Delayed processing, rescheduling 

e Arbitrary topologies 

e Dependent events 


What If & how to? 


Complex processing scenarios and chains 
e Delayed processing, rescheduling 

e Arbitrary topologies 

e Dependent events 


e RabbitMQ 
e Tarantool Queue / Tarantool 


Invitation to KTS Stand 


Please join to our tech talk Today, 14:20 
about queues and Discussion zone 
Tarantool 


die 


Please, leave your feedback! 
Slides: Feedback: 


https://bx.vc/34e3l2 https://bx.vc/lkmguh 
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